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£\j , Abstract. We present a new study of the effects of quasar obscuration on the statistics of Damped Ly a (DLA) 

^ ' systems. We show that the extinction of any Galactic or extragalactic H I region, Ax , increases linearly with the 

column density of zinc, with a turning point dA\/d(\og Nz n ) ~ 1, above which background sources are suddenly 
obscured. We derive a relation A\ — A\(Nn,Z,z) between the extinction of a DLA system and its Hi column 
density, Nn, metallicity, Z, fraction of iron in dust, f^ e {Z), and redshift, z. From this relation we estimate the 
fraction of DLA systems missed as a consequence of their own extinction in magnitude-limited surveys. We derive 
a method for recovering the true frequency distributions of Ah and Z in DLAs, /jv h and fz, using the biased 
distributions measured in the redshift range where the observations have sufficient statistics (1.8 < z < 3). By 
applying our method we find that the well-known empirical thresholds of DLA column densities, Nz n 10 13 ' 1 
atoms cm -2 and iVm 10 22 atoms cm -2 , can be successfully explained in terms of the obscuration effect without 
tuning of the local dust parameters. The obscuration has a modest effect on the distribution of quasar apparent 
magnitudes, but plays an important role in shaping the statistical distributions of DLAs. The exact estimate of 
' the bias is still limited by the paucity of the data (~ 40 zinc measurements at 1.8 < z < 3). We find that the 

^ \ fraction of DLAs missed as a consequence of obscuration is ~ 30% to 50%, consistent with the results of surveys 

of radio-selected quasars. By modelling the metallicity distribution with a Schechter function we find that the 
mean metallicity can be ~ 5 to 6 times higher than the value commonly reported for DLAs at z ~ 2.3. 



o 
in 
o 



x 



Key words. ISM: dust, extinction - Galaxies: ISM - Galaxies: high- redshift - Quasars: absorption lines 

1. Introduction scuration bias (Ostriker & Heisler, 1984; Fall & Pei 1989, 

1993). 



Quasar absorption line systems probe the diffuse gas in the 
Universe over cosmological scales and at various stages of 
evolution. Among these absorbers, the Damped Lya sys- 
tems (hereafter DLAs) have the highest Hi column den- 
sities (iVm > 2 x 10 20 atoms cm -2 ) and are believed to 
originate in H I regions of high-redshift galaxies (Wolfe et 
al. 1986). Also, they have higher metallicities 1 than any 



The bias can lead to misleading conclusions on the 
nature of the galaxies associated with the DLA systems 
(herefater DLA galaxies) , because the H I regions most en- 
riched by metals and dust would be systematically missed 
in the surveys. In addition, some properties of the high- 
redshift Universe that can only be derived from studies of 

. , , , , , , -, -, , ,, DLAs, would also be affected by the bias. One example 

other class of quasar absorbers ( JVL/H h=a —1.1 dex; Pettini . . , ., ~ , , „ , 

, , , , • , T r is the comovmg mass density of neutral gas, Udla, and 

et al. 1994) and as such are expected to contain dust. If , ,. ... , , . . . . . ,. , r 

, , . , . „ T . ... . . . ,. its evolution with redshift, which is an indicator of gas 

present, the dust m DLAs will absorb and scatter the ra- 



diation of the background quasar, dimming its apparent 
magnitude ("extinction" effect) and changing the slope 
of its spectral distribution ("reddening"). In magnitude 



consumption due to star formation (Wolfe et al. 1995). 

Another example is the mean cosmic metallicity inferred 

from abundance studies of DLAs (Pettini et al. 1997), 

.. . , , . , . which is related to the star formation rate in the early 

limited surveys, the extinction may obscure the quasar, TT . . „ „ .. .„„- , r , n • ^ i , nn f,\ 

. ,. J ' ^. . . L . „ ^ , , Universe (Pei & Fall 1995, hereafter PF95: Pei et al. 1999). 

leading to an observational selection effect known as ob- 

Whether the obscuration effect is important, depends 
on the actual amount of dust in DLAs. The first evidence 
Send offprint requests to: G. Vladilo of dust, in the form of reddening, was reported by Pei et 

1 We adopt the usual definition [X/H] = log (X/H) - log al. (1991). The complex variability of the quasar contin- 
(X/H)q. uum makes it impossible to determine the reddening in 
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individual cases. However, from a statistical comparison 
of quasars with and without foreground DLAs one can 
search for a systematic change of the continuum slope. In 
this way, Pei et al. found a systematic difference, sugges- 
tive of reddening, using a simplified power-law fit to the 
continuum distribution. 

The detection or reddening motivated detailed studies 
of quasar obscuration. Fall & Pei (1993; hereafter FP93) 
presented an analytical method of computing the obscura- 
tion from the observed luminosity function of quasars, the 
typical dust-to-gas ratio of DLAs and the empirical dis- 
tribution of H I column densities. By modelling the radial 
distribution of the neutral gas in DLA galaxies, they con- 
cluded that a large fraction of quasars may be obscured 
in optically selected samples. The large uncertainty of the 
reddening measurements prevented reaching firm conclu- 
sions on the dust-to-gas ratios and on the magnitude of 
the bias. 

An alternative approach to measure the amount of the 
dust in DLAs consists in measuring the differential de- 
pletions between refractory and volatile elements (Pettini 
et al. 1994; Hou et al. 2001; Prochaska & Wolfe 2002; 
Ledoux et al. 2003; Vladilo 2002, 2004). The depletions 
can be converted into dust-to-gas ratios, yielding an ap- 
proximate estimate of the quasar extinction (Vladilo et 
al. 2001b; Prochaska k Wolfe 2002). This method, as well 
as the reddening measurements, only give information on 
the detected DLAs, leaving open the possibility that more 
dusty H I regions remain undetected. 

A more direct probe of the obscuration effect is a com- 
parison of DLA statistics in optically selected and radio se- 
lected quasar samples, given that the latter are unaffected 
by dust extinction. Ellison et al. (2001) compiled a ho- 
mogeneous sample of radio selected quasars and searched 
for DLAs towards every target, irrespective of its optical 
magnitude (CORALS survey). They concluded that dust- 
induced bias in previous magnitude limited surveys may 
have led to underestimating the H I mass density, JIdla , 
and the number density per unit redshift interval, n{z), 
of DLAs by at most a factor of two at z ~ 2.3. In ad- 
dition, they found tentative evidence that n[z) is greater 
in fainter quasars, as expected by the obscuration effect. 
In a follow-up study of CORALS metallicities, Akerman 
et al. (2005) find [< Zn/H >] = -0.88 ±0.21. This value 
is higher than in a control sample, as expected by the 
obscuration, but only at la level. In an extension of the 
CORALS survey at 0.6 < z < 1.7 Ellison ct al. (2004) 
find n(z) = 0.16±°;°* using Mgll absorbers as DLA can- 
didates. This value is consistent with magnitude-limited 
estimates at the same z, but the la error permits a factor 
of 2.5 difference in the sense predicted by the obscuration. 

The results of the CORALS survey are not yet conclu- 
sive since the original sample of Ellison et al. had limited 
statistics, in particular at the high values of AT(Hi) where 
the effect is expected to be more important. This weak ev- 
idence of obscuration, together with the non-detection of 
reddening in a large sample of quasars with and without 
candidate DLAs, recently reported by Murphy & Liske 



(2004) , are conveying the impression that the obscuration 
effect is not important. 

Yet, indirect evidence for the existence of the bias 
comes from other studies. The analysis of [Zn/H] versus 
log V(Hi) by Boisse et al. (1998) showed that none of the 
DLAs with zinc detection known at the time have a metal- 
licity above the threshold [Zn/H] + log N(U i) ~ 20.5, cor- 
responding to V(Znii) ~ 10 13 ' 1 atoms cm~ 2 . They pro- 
posed that this threshold could be attributed to obscura- 
tion effect, assuming that metal column density tracks the 
dust column density. 

While this assumption has not been proven by sub- 
sequent investigations, the threshold proposed by Boisse 
et al. has been invoked to reconcile several predictions of 
galactic evolution models (Prantzos & Boissier 2000; Hou 
et al. 2001) and cosmological simulations (Cen et al. 2003) 
with the observed properties of DLAs. 

Another potential evidence of the bias is the lack of 
DLAs with Hi column densities V(Hi) ^ 10 22 atoms 
cm~ 2 . In this case, the obscuration has been invoked to 
explain the significant fraction of model galaxy disks pre- 
dicted to have higher H I column densities (Churches et al. 
2004) . As an alternative explanation, it has been proposed 
that hydrogen may undergo a sudden transition from the 
atomic to the molecular form above a critical column den- 
sity threshold (Schaye 2001), in which case the absorber 
would not be detected as a DLA system. 

In the present work we address several open questions 
concerning the existence and importance of the obscura- 
tion bias. In Section 2, we perform a careful investiga- 
tion of all the factors that determine the extinction of a 
Galactic or extragalactic H I region. In Section 3 we report 
some indirect evidence of obscuration effects. In Section 
4 we present a mathematical formulation aimed at recov- 
ering the frequency distributions of Hi column densities 
and metallicities of DLAs from the observed distributions 
affected by obscuration bias. The first implementation of 
this method is described in Section 5 and the results are 
discussed in Section 6. 



2. The relation between Nz n and extinction 

We adopt the simplified notation Nz n = JV(Znll) and 
Nh — iV(Hl), ignoring contributions from ionization 
states other than the dominant one (see Vladilo et al. 
2001a and references therein). 

We start by considering a refractory and a volatile 2 
element, E r and E v , with interstellar abundance ratio by 
number (E r /E v ). The ratio of the total column densities 
(gas plus dust) of these two elements along an interstellar 
line of sight will be N^/N^ = (E r /E v ). We call N& and 
Ve the column densities of an element E in the dust and 
in the gas, respectively. The fraction of atoms of E in dust 
form will be /e = N^/N^ 1 . From these definitions we 

2 Refractory elements easily condense into dust form, while 
volatile elements tend to remain in the gas phase. 
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have Nf ot = iV r d // r and A^ ot = N v /(1 - / v ). Combining 
these relations we have 

(l-/v) Uv 

We next consider the relation between Njr and the col- 
umn density of dust grains, Ngr, i.e. the number of grains 
per cm 2 along the line of sight. For spherical grains of ra- 
dius r gr (cm) and internal mass density g gI (g cm ) it is 
easy to show that 



W r d A r m H = - it r 3 r g gI N gI X? , 



(2) 



where A r is the atomic mass of the refractory element and 
Xf the mass fraction in the dust of the same element. 

Finally, we consider the relation between the interstel- 
lar extinction in magnitudes at the wavelength A, A\, and 
the column density of dust grains. For spherical grains 
with extinction efficiency factor Q e ,\ (see Spitzer 1978, 
Chapter 7) this relation is 

A x = 1.086 N gI Q e)X nr 2 gr . (3) 

By combining Eqs. Q, J2J and J3J) we derive 
A x =a x N v , (4) 
where we dehne 

/r [ E r 



"A 



1.817 x 10"^ A r g x 



(l-/v) VE 



and we isolate in the term 



3A 



' gr 



Q gI X? 



(5) 



(6) 



all the properties of the grains (shape, extinction ef- 
ficiency, size, density, and composition). For spherical 
grains s = 3/4. 

The factor / r /(l — / v ) is approximately constant by 
definition of volatile and refractory elements. In a medium 
of constant composition, such as the local ISM, also the 
abundance (E r /E v ) is constant. Therefore, for a given type 
of dust, the term a\ is constant and the expression Q 
implies that the interstellar extinction increases linearly 
with the gas-phase column density of any volatile element 
measured in the same line of sight. 

Differences in the shape, size, density, composition and 
extinction efficiency of the grains will induce a scatter in 
the relation. The dependence of g\ on the grain size r gr is 
linear, while in Eqs. and it was cubic and quadratic. 
Since the grain size distribution is a key factor in deter- 
mining the wavelength dependence of the interstellar ex- 
tinction (see e.g. Draine 2003), the mild dependence of g\ 
on r gr suggests that the relation between A\ and N v may 
be similar even in interstellar regions with different types 
of extinction curves. 

By adopting zinc as the volatile element and consid- 
ering the extinction in the V band (Xy = 0.55^m), we 
obtain from the relation 
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Fig. 1. Extinction Ay = Ry Eb-v ( ma g) versus Znn col- 
umn density (atoms cm~ 2 ) in Galactic interstellar lines of 
sight. Filled diamonds: HST data from Roth & Blades 
(1995). Empty diamonds: IUE data from Van Steenberg 
& Shull (1988). Curve: extinction law (JJJ with ay — 
0.29 x 10~ 13 mag cm 2 derived empirically as explained 
in the text. 

where the parameter ay (mag cm 2 ) can be determined 
by comparing measurements of Ay and Nz n in individual 
lines of sight. In this way it is possible to measure the 
scatter of ay due to variations of the grain properties. 

To estimate ay we gathered the Nz n measurements 
obtained from HST (Roth & Blades 1995) and IUE (Van 
Steenberg & Shull 1988) high-resolution spectra. The IUE 
data are more numerous, but have lower resolution and 
signal-to-noise ratio. To reduce the impact of these dif- 
ferences we selected the IUE measurements with total er- 
ror < 0.3 dex in logA'zn. To minimize the contamina- 
tion by stellar absorptions on top of the IUE Zn n inter- 
stellar lines (Van Steenberg & Shull 1988), we selected 
only IUE spectra of fast-rotating stars (Vsini > 200 km 
s _1 ). We then scaled the IUE column densities to match 
the Bergeson & Lawler (1993) Znll oscillator strengths 
adopted in the HST data and in the current research. As 
far as the extinctions are concerned, we used the Ay val- 
ues given by Guarinos (1991). As a result, we obtained 
a sample of 11 lines of sight. The mean value of extinc- 
tion per unit zinc column density that we derive is <ay> 
= 0.27(±0.06) 10~ 13 mag cm 2 and 0.38(±0.20) 10~ 13 mag 
cm 2 for the HST and IUE sub-samples (4 and 7 lines of 
sight, respectively). 

In order to enlarge the sample we also considered 
the color excess Eb-v, which is more commonly mea- 
sured than Ay. The ratio of general-to-selective extinc- 
tion, Ry = Av/Eb-v, is approximately constant, with a 
mean value <Ry> ~ 3.1 in the diffuse ISM of the Milky 
Way (see e.g. Draine 2003). We used this property to de- 
rive a mean value <ay> = <Ry> <Eb-v /Nz a > from red- 
dening and zinc column density measurements. The Eb-v 
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data were taken from the extinction catalog of Savage et 
al. (1985). The resulting sample includes 24 lines of sight, 
shown in Fig. ^ The mean value of the HST sub-sample 
(9 lines of sight) is < ay > = 3.1 x 0.089 (±0.030) 1CT 13 
mag cm 2 , and of the IUE sub-sample (15 lines of sight), 
3.1 xO. 123 (±0.071) 10~ 13 mag cm 2 . We did not find signif- 
icant correlations between <Eb-v / Nzn> and the param- 
eters of UV extinction provided by Savage et al. (1985), 
such as the strength of the emission bump or the steepness 
of the UV rise. The lack of these correlations gives us a 
justification for averaging the results obtained from lines 
of sight with different types of dust. 

The results obtained from the Eb-v analysis are iden- 
tical to those obtained from the more limited sample 
of direct measurements of Ay. The larger dispersion of 
the IUE sub-sample is probably due to the lower qual- 
ity of the data. We adopted the weighted average of 
the IUE and HST sub-samples, which yields < ay > = 
0.29(±0.07) 10 -13 mag cm 2 , a value almost identical to 
that of the HST sub-sample. The relation JZJ obtained 
from this estimate of ay is displayed in Fig.n(solid curve). 
The exponential rise of the extinction is due to the fact 
that the Znii column density is plotted on a logarithmic 
scale. In this scale the rise of the extinction is mild at low 
values of log7Vz n , but very fast at high values. The turning 
point marking the transition between these two regimes 
can be estimated from the condition dAy /d{k>g Nz a ) = 1< 
This criterion yields logiVzn = — log(< ay > In 10) = 
13.18. Above this turning point the extinction undergoes a 
dramatic rise, acting as a barrier against detection of faint 
background sources. This barrier is remarkably similar to 
the detection threshold proposed by Boisse et al. (1998). 
This motivated us to investigate the relation between ex- 
tinction and metal column density in DLA systems. 

2.1. Extinction versus Nz n in DLA systems 

The relations J2J , © and © are valid in the ISM of any 
galaxy, including DLA systems, which are Hi regions of 
high-rcdshift galaxies. Applying these relations to a DLA 
system and to the local ISM, and adopting E v =Zn, E r =Fe 
and A = \y , we obtain 



A\ 



ay i 



9v (l-/z n) i) /fc (Fe/Zn) 



gv,i 



/Fe,i 



(1-fzn) (Fe/Zn)* 



^Zn , (8) 



where the quantities in the local ISM are indicated with 
the index i and those in the DLA system are not la- 
belled. To take into account the dependence on metallicity, 
which varies among DLAs, we introduce the metallicity 
Z = N^/Nn, where N%£ = N Zn /(l - f Zn ) is the total 
zinc column density (gas plus dust). In this way we obtain 
the expression 



Ay ~ A G /fc 



(Fe/Zn) 
(Fc/Zn) 



Z Nh 



(9) 



where A Q = a Vji (1 - fzn,i) / fp e ,i and G = gv/gv.i. The 
constant A Q is estimated using the value <ay> derived in 



the previous section and the mean values of dust fractions 
of the local ISM sample (/ ZlM = 0.40 and / Fe ,i = 0.94). 
We obtain A Q ~ 1.85 x 10~ 14 mag cm 2 , with a relative 
error of ~ ±0.15 dex including the uncertainties of the 
dust fractions. The DLA extinction will vary according to 
the Fe/Zn abundance in the medium, the iron depletion 
and the properties of the grains. We briefly discuss each 
one of these factors. 

Relative abundances of metals do not show strong vari- 
ations in DLAs, despite significant changes of the abso- 
lute metallicity Z. The Fe/Zn ratio is very close to solar 
in metal-poor stars with metallicities typical of DLA sys- 
tems (Mishenina et al. 2002; Gratton et al. 2003; Nissen 
et al. 2004; see discussion in Vladilo 2004). Deviations 
from the solar ratio are measured in stars with extremely 
low metallicities. Such deviations do not affect the present 
work in any case, because the dust fraction and the ex- 
tinction vanish at very low metallicity, as we now discuss. 
We adopt therefore (Fe/Zn)/(Fe/Zn) = 1. 

The depletion of DLAs increases with metallicity 
(Ledoux et al. 2003). In particular, the dust fraction of 
iron undergoes a fast rise between between metallicities 
[Zn/H] ~ — 2 dex and —1 dex, from values close to zero 
up to values typical of the Milky- Way warm interstellar 
gas (Vladilo 2004). Here we model this trend with the 
analytical expression 



1 



h c {Z) = - 



arctan 



[Zn/H] - [Zn/H] c 
A([Zn/H]) 



designed to smoothly fit the ISM and DLA data, providing 
a gradual decrease to zero at very low metallicity. 

The dependence on the changes of the grain prop- 
erties enters through the factor G, which scales as 
sQ e ,x (?"gr Pgr^Fe) -1 O^l- EJ- This term is mostly deter- 
mined by the physical conditions of the medium, rather 
than by its chemical enrichment, with the possible excep- 
tion of the abundance by mass of iron in the grains, Xp c . 
At the very early stages of chemical evolution we may ex- 
pect variations of Xp c as a consequence of non-solar rela- 
tive abundances of metals. However, these effects do not 
affect our estimate of the extinction since, at the very low 
metallicities typical of these early stages (Z <C Zq), the 
factor /fc(zT) vanishes in any case, yielding a null extinc- 
tion. 

The variations of G due to changes of the physical con- 
ditions can be assessed by sampling different regions in a 
medium of constant composition, such as the local ISM. 
The low scatter of ay derived in the previous section in- 
dicates that the scatter of G in the local ISM is low, even 
if the MW sample includes lines of sight with different 
physical conditions. To explain this qualitatively, we can 
imagine that variations of the grain size r gr may compen- 
sate variations of the dust density g gr . For instance, in a 
harsh interstellar environment, volatile elements tend to 
leave the grains and we may expect an increase of g gI be- 
cause carbon, the element potentially most abundant in 
the grains, is volatile and has low atomic mass. At the 
same time, we may expect that larger grains will be more 
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Fig. 2. Extinction of a quasar at a fixed wavelength in 
the observer's frame, plotted versus the redshift of an in- 
tervening DLA system with = 10 1350 cm -2 . Solid 
and dashed curves represent MW-type and SMC-type ex- 
tinction curves, respectively (Cardelli et al. 1998; Gordon 
et al. 2003). Curves left to right: photometric bands u', g', 
and r' , respectively. 



easily destroyed in a harsh environment and the mean r gr 
will decrease, countering the increase of g gT in Eq. |JB}. 

These general considerations apply to any type of in- 
terstellar environment, suggesting that G may not vary 
dramatically in DLAs with moderately low abundances 
(Z ^ 0.1 Zq). To test this hypothesis we computed G in 
the Small Magellanic Cloud (SMC) from the expression 



Gsmc — 



(^)SMC 



'A 
■ N 
( Ay 



■>SMC 



/Fe.SMC 



1vh)mw 

applying 



Zp) /Fe.MW 



(10) 



Eq. © to the 
(Fe/Zn) . Inserting (^)smc 



SMC 



with 
,7 x 



^ 2 (Gordon 2003; SMC bar), (^)mw 



obtained by 
(Fe/Zn)sMC 
10 -23 mag cm 
2.1 x 10~ 22 mag cm 2 (Bohlin et al. 1978), Z SMC ~ 
0.25 Z Q (Russel & Dopita 1992), and /fc,smc(^SMc) - 
0.91 /fc,mw from our analytical relation /p c = f-p e (Z), we 
obtain Gsmc — 0.6. Considering the uncertainties in this 
derivation, this result is consistent with the Milky- Way 
value, suggesting that, indeed, the term G may not vary 
dramatically in galaxies of moderately low abundances 
even if the extinction curve is different. 

Eq. is valid in the rest frame of the Hi region. 
To derive the extinction in the observer's frame we need 
to consider the wavelength dependence of the extinction. 
In the rest frame of the DLA system the extinction at 
the wavelength A' will be Ay ~ £(A') Ay, where £(A') = 
A(X')/A(Xy) is, by definition, the normalized extinction 
curve. In the observer's frame the same extinction will 
appear at A = A' (1 + z), where z is the absorption redshift. 



In conclusion, the observer's frame extinction of a 
quasar at wavelength A due to an intervening DLA system 
at redshift z is 



A x (N H ,Z;z)~A G £ 



A 



1 + z 



f Fe (Z) TVh Z 



(11) 



with A ~ 1.85 x 10~ 14 mag cm 2 , and G = 1 and 0.6 for 
Galactic- type and SMC-type dust, respectively. For each 
type of dust we adopt the corresponding extinction curve 
representative of the Milky Way and of the SMC. 

Since £(A') > 1 for A' < Ay, the net effect of the 
cosmological redshift is an amplification of the rest-frame 
extinction at A < Ay (1 + z). At z il 2 this amplifica- 
tion compensates for the reduction of fpc(Z) in systems 
of moderately low metallicity and for the mild reduction 
of G in SMC-type dust. Only at very low metallicity the 
extinction becomes negligible since fF C (Z) tends to van- 
ish. Overall, the extinction per unit column density of zinc 
atoms predicted for most DLAs is similar, or even larger, 
than that typical of the Milky- Way ISM. 

Examples of quasar extinction estimated with Eq. 111(1 
are plotted in Fig. [3 as a function of z, keeping fixed the 
other terms. This type of representation puts in evidence 
the amplification of the extinction with increasing red- 
shift. Each curve was computed at a constant A equal to 
the effective wavelength of the SLOAN Digital Sky Survey 
(SDSS) photometric bands u', g' and r' (Fukugita et al. 
1996). We adopted the extinction law by Cardelli et al. 
(1998; hereafter CCM law) with R v = 3.1 for the Milky- 
Way extinction curve and the mean SMC bar data by 
Gordon et al. (2003) for the SMC curve. The term f Fc (Z) 
was estimated at [Zn/H] = log(Z/Zo) = —1 dex. The 
total zinc column density was fixed at — Ah Z = 

10 13 50 atoms cm~ 2 , roughly two times the zinc column 
density threshold proposed by Boisse et al. (1998). For 
the redshift interval typical of most DLAs (z w 2/3) the 
predicted extinction is > 1 mag, a value sufficient to ob- 
scure quasars in spectroscopic surveys. 

3. Evidence for extinction from DLA data 

To search for some extinction effect related to the zinc col- 
umn density we collected from the literature all DLAs with 
available N% n measurements. The resulting sample, shown 
in Fig. |21 includes 41 measurements, most of which derived 
from Hires/Keck and UVES/VLT spectra. References to 
these data can be found in Vladilo (2004; table 1), with 
the exception of 4 systems (without Fe n lines) published 
by Pettini et al. (1994, 1997) and Boisse et al. (1998). 

The updated sample includes more than twice the mea- 
surements of the one discussed by Boisse et al. (1998) but, 
in spite of this increase, column densities above the orig- 
inal threshold (line AB in the figure) are not found. The 
lack of DLAs with Nz n > 10 13 ' 1 atoms cm~ 2 is surprising, 
since Milky Way lines of sight with Znii measurements 
do show values above this limit in a significant fraction of 
cases (Fig. |3J). Studies of gamma-ray bursts (GRBs) ob- 
served inmediately after the explosion demonstrate that 
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Fig. 3. Diamonds: column densities Ah versus Nz n in 
DLAs. Horizontal lines: DLA definition threshold (bot- 
tom) and empirical upper boundary (top) of Ah- Vertical 
lines: detection limit (left) and obscuration threshold 
(right) of Nzn- Dashed lines: lines of constant metallic- 
ity. Circles: Milky Way data of Fig. 1. Star: GRB-DLA 
from Savaglio et al. (2003). 



zinc column densities above the threshold exist and can 
be detected in extragalactic absorbers 3 (Savaglio et al. 
2003; the only case with measured Ah is indicated with a 
star in Fig. 01 . 

These results confirm that some selection effect is pre- 
venting the detection of values of Az n above the threshold 
when the background source is faint. The remarkable sim- 
ilarity between the DLA cutoff at Az n = 10 131 atoms 
cm -2 and the ISM turning point at Nz n — 10 132 suggests 
that the rapid rise of the extinction above the turning 
point can be responsible for the cutoff. This is consistent 
with the conclusion of the previous section that the ex- 
tinction per unit zinc column density of many DLAs is 
similar to that of the local ISM. 

If the extinction increases with Az n , we expect to find 
evidence of this effect also below the threshold, in the 
range log Az n 13.1 where we do detect DLAs. To search 
for this evidence we analysed the frequency distribution of 
quasar apparent magnitudes of the zinc sample. We used 
the V magnitude, which is measured in most cases. In 4 
quasars without V measurements we adopted the magni- 
tude of nearby optical bands. In practice, we compared 
the behaviour of the two sub-samples with logAz n be- 
low and above the median value, (log Az n ) median = 12.53. 
The differences between the two distributions, shown in 
Fig. 01 are consistent with a rise of the quasar extinction 

3 Absorbers detected in front of GRBs, however, may have 
different properties from classical DLAs; a claim for gray ex- 
tinction in GRB/DLAs has been made by Savaglio et al. 
(2003). 



Fig. 4. Normalized frequency distribution of quasar ap- 
parent magnitudes for the sub-sample of DLAs with 
logiVzn > 12.53 (shaded histogram) and logAz n < 12.53 
(light histogram). 

with increasing Az n . The analysis of the three bins with 
more statistics indicates that the maximum of the sub- 
sample with high Nzn (shaded histogram) is shifted by 
« 1 bin relative to the maximum of the other sub-sample 
(light histogram). A Kolmogorov-Smirnov test shows that 
there is only a 9.7% level of probability that the two dis- 
tributions are drawn from the same parent population. 
The shift to fainter magnitudes suggests that even at 
logAz rl < 13.1 dust extinction is already present, affect- 
ing more the high- Az n sub-sample. The magnitude of this 
effect can only be explained by invoking a redshift am- 
plification of the extinction, as predicted by relation (|11|) , 
since the extinction expected at rest-frame is only ~ 0.35 
mag at log A Zn = 13.1 (Fig.HJ. 

4. Mathematical formulation of the bias 

In Eq. 1)1 l[l the variation of the depletion in DLAs is ac- 
counted for by the term fp e (Z), which is determined by the 
metallicity. The term G is approximately constant, at least 
in galaxies with metallicity similar to those of the SMC 
and Milky Way. Possible variations of G in galaxies of very 
low metallicity do not affect the prediction of the extinc- 
tion since, in any case, f-p e {Z) vanishes when Z <C Zq. 
For these reasons, relation 1)11)1 allows us to estimate the 
extinction of a DLA system using, in practice, only its Hi 
column density and metallicity. We use this property to 
derive a mathematical formulation of the obscuration bias 
based on the study of the distribution functions of Ah and 
Z in DLA systems. In practice, we determine the fraction 
of DLAs that are missed as a consequence of their own ex- 
tinction. We then assume that multiple DLA absorbers in 
a given line of sight play a negligible role in the estimate 
of the obscuration bias. In this way, as we show in the 
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Appendix, we derive a mathematical formulation of the 
bias which only makes use of observable statistical distri- 
butions of DLAs and quasars. To quantify the obscuration 
effect we need a good statistics of the observed distribu- 
tions and this requirement limits the redshift range where 
our method can be presently applied, since the bulk of 
DLAs data are currently concentrated in 1.8 < z < 3.0. 

An advantage of our formulation is that it does not 
require a knowledge of the intrinsic luminosity function 
of quasars or the geometrical distribution of the neutral 
gas in DLA galaxies. We refer to FP93 for a mathemat- 
ical treatment of the effect of quasar obscuration which 
describes the relation with the quasar luminosity function 
and the geometrical distribution of the gas in the inter- 
vening galaxies. 



In deriving these equations we made the assumption 
that the distributions of Ah and Z are statistically inde- 
pendent. We discuss this assumption in Section 4.4. 

4.1.1. The quasar magnitude distribution 

We call n(m; z) dm the number of quasars with apparent 
magnitude to 6 (to, to + dm) observable beyond redshift 
z all over the sky. This number is defined in absence of 
quasar obscuration and we call n(m; z) dm the "true" dis- 
tribution. Our goal is to infer the true distribution from 
the observed one, which must equal the distribution biased 
by the obscuration effect, n h (m; z). The relation between 
true and biased distribution is discussed in the Appendix 
(Section A. 2), where we derive the equations Eq. I)A.19|1 . 



4.1. True and biased distributions 

We consider the DLAs in the redshift interval (z m i n , z max; 
that are detectable in a survey with limiting magnitude 
me. We call /at h cZAh the number of such DLAs with col- 
umn densities between Ah and Ah + c?Ah and fz dZ the 
number of those with metallicity between Z and Z + dZ. 
The distributions /n h and fz are defined in absence of 
obscuration bias and we call them the "true" distribu- 
tions. When we consider the effect of the extinction gen- 
erated by these DLAs, the number of detectable systems 
will be fN H dN H and f%dZ. We call /& H and f\ the "bi- 
ased" distributions. In the Appendix we derive the follow- 
ing relations between true and biased distributions [see 
Eqs. EUl 



and 



b J~B mt (N H ,Z)fzdZ 

JNn — r oo j. — — JNh 



and 



Jo^fzdZ 



J^ A B me (N H ,Z)f NH dN H 



(12) 



(13) 



For simplicity we omit the dependence of the equations 
on z m i n and z max - We assume that the statistical distribu- 
tions of interest smoothly vary inside the redshift interval, 
so that we can approximate them with their value at mean 
redshift z — (z max — z m i n )/2, where z is close to the peak 
of the observed distribution. 

The "bias function" B mi (Ah, Z) = B mt (A H , Z, z) that 
transforms the true distributions into the biased ones is 
given by the relation [see Eq. (|A.5|) ] 



B me {N Hl Z, z) 



= f™*- A >( N «' z <*) n (m;z)dm 
J o " n(m; z) dm , 



(14) 



where A\{N^ 1 Z, z) is the DLA extinction and n(m; z) the 
quasar magnitude distribution. 

In the next section we show how to estimate n(m; z), 
while in Section 4.3 we present a procedure for deriving 
the true distributions from Eqs. I|12fl and (|13fl . starting 
from the observed distributions. 



n h (m; z) ~ [l — JFi] n(m; z) + T\ n\(m; z) , (15) 
and Eq. (IA.20j) . i.e. 

/>oo 

n\{m; z) = K I n(m';z) $ aSjti — m';z e ) dm' . (16) 



In these relations T\ is the fraction of quasars with one 
foreground DLA, ?i^(to; z) the magnitude distribution of 
quasars with one foreground DLA, if is a normalization 
factor and fA x (A\;z e ) the frequency distribution of the 
extinctions of the DLAs in the redshift range z G (0,z e ) 
(z e is the mean redshift of the quasars in the survey). In 
Section 4.3, we show how to use Eqs. I|15l) and l|l(jD to infer 
n(m; z) (see also Appendix A.2). 

4.2. Obscuration fraction 

We define the obscuration fraction using the mathematical 
relations between the true and biased distributions. We 
call total obscuration fraction the quantity 



(17) 



The total obscuration fraction represents the number ratio 
of DLAs missed due to quasar obscuration in a survey with 
limiting magnitude mg. The dependence on the limiting 
magnitude enters in /Jy through Eqs. (|A.5|I and l|A.12|) . 
The total obscuration fraction can also be derived using 
the true and biased distributions of metallicity. 

We call obscuration fraction, <f> mi , the fraction by num- 
ber of systems obscured at a particular value of column 
density or metallicity. For instance, the fraction of systems 
obscured at a specific value of Ah is 



; (^h) = 1 



(18) 



Similar expressions can be derived to define the obscu- 
ration fractions 4> me (Z) and <^(A| n ), i.e. the fraction of 
systems missed at a particular value of Z and A| n . 
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4.3. Solution of the equations 

The true functions /jv h and fz are the unknown quan- 
tities in Eqs. I(12|l and (JT3J. The procedure that we fol- 
low to solve these equations consists in (i) adopting trial 
functions /at h and fz, (ii) estimating n m = n(m;z), (iii) 
computing and f\ from the equations and (iv) com- 
paring these biased distributions with the observed ones. 
These steps are repeated by changing the trial functions 
until the predicted biased functions match the observed 
ones. We briefly describe each step of this procedure. 

In the first step, we adopt analytical functions in para- 
metrical form as the trial functions /at h and fz- Once we 
choose the shape, the problem of determining the true 
functions /jv h and fz is equivalent to that of determining 
the values of their parameters. 

In the second step we use the trial functions /jv h and 
fz to derive the distribution of extinctions from Eq. 1)11(1 . 
We use this extinction distribution in Eq. 1)16(1 . We then 
iterate Eqs. 115|) and i(16|) . starting with a trial function 4 
n(m; z), until we find a n(m; z) that yields a perfect agree- 
ment between n h (m;z) and the distribution n° hs (m;z) 
measured from the surveys. 

In the third step we use n(m;z) to derive B me (Nn, Z) 
from Eq. ((14(1 and, finally, the biased distributions /Jj- 
and f\ from Eqs. © and 

In the last step we compute the % 2 deviation between 
the predicted distributions and f\ and the corre- 
sponding observed distributions. 

By iterating the steps we determine the parameters of 
/at h and fz which allow f\ and f\ to best fit the empir- 
ical distributions. The search is done via \ 2 minimization. 
Since we are not guaranteed a priori of the unicity of the 
solution, we carefully investigate the variation of x 2 in the 
parameter space to make sure the minimum is unique. 

4.4. Are and Z statistically independent ? 

The mathematical formulation presented above relies on 
the assumption that Nu and Z are statistically indepen- 
dent of each other in the true population of DLAs. The 
local ISM offers an example in support of this indepen- 
dence, since a broad range of Hi column densities, includ- 
ing the high values typical of DLAs, are found at constant 
metallicity, Z ~ Zq. Similarly, a DLA system of given 
metallicity Z ~ Z will display a distribution of H I col- 
umn densities, if sampled along random lines of sight. This 
distribution will reflect the mass spectrum of interstellar 
clouds, which is likely to be determined by some physical 
mechanism, such as interstellar turbulence (see discussion 
in Khersonsky & Turnshek 1996 and refs. therein), rather 
than by the metallicity. In this sense, it is reasonable to 
assume that /jv h and fz are independent distributions. 

Independent support of these arguments comes from 
the hydrodynamic simulations of DLAs evolving in ACDM 

4 The observed distribution n h (m;z) can be used as the 
starting trial function of n(m; z). 



cosmology (Cen et al. 2003), which indicate that metal- 
licity and hydrogen column density are almost completely 
uncorrelated. Similar results are found from the cosmolog- 
ical SPH simulations performed by Nagamine et al. (2004). 

Notwithstanding, a trend is expected because the 
metallicity correlates with the star formation rate, which 
in turn increases with the gas density in the host galaxy. 
Therefore, lines of sight with high jVh may have a ten- 
dency to be associated with higher metallicities. A weak 
evidence for this trend was found by Cen et al. (2003), but 
the effect is particularly small at redshift z ~ 2 — 3. If the 
metallicity is correlated with Nu, the effect of the obscu- 
ration is more important than in the ideal case considered 
above. In this case the application of our mathematical 
formulation gives a conservative estimate of the bias. 

4.5. Gravitational magnification 

Competing with the obscuration effect, is the possible 
gravitational magnification of the background quasar by 
the absorber itself. Many studies over the years have at- 
tempted to estimate the magnitude of this phenomenon 
for both DLAs (Smette et al. 1997, Le Brun et al. 2000) 
and metal absorption lines (Vanden Berk et al. 1996, 
Menard & Peroux 2003). The results show that for the 
DLAs selected at optical wavelengths, gravitational effects 
are small. In the present application of our method, based 
on DLAs selected at optical wavelength, we therefore ne- 
glect the magnification effect. 

5. Implementation of the procedure 

We describe the statistical distribution functions adopted 
here as a first example of implementation of the proce- 
dure. For the intrinsic distributions we only need to adopt 
a functional dependence (i.e. a shape), leaving to the pro- 
cedure the task of determining the parameter values. 

5.1. The empirical distributions 

We consider the sub-sample of DLA systems with avail- 
able measurements of Znii column densities, for which it 
is possible to derive metallicities corrected for depiction ef- 
fects (Vladilo 2004). The differences between the corrected 
metallicities and the [Zn/H] values taken at face value are 
small, with a mean value of +0.12 dex. Given the size of 
the zinc sub-sample (~ 40 systems; see list of references 
in Table 1 of Vladilo 2004) we can estimate the frequency 
distribution of metallicities with a Poisson statistical er- 
ror 10% in only a few bins. To make the best use of the 
available data we selected the systems in the redshift range 
1.8 < z < 3, where most of the N% n measurements have 
been obtained so far. The mean redshift of the sample (28 
systems) is z ~ 2.3. To derive the empirical distribution 
of Nh w e used the surveys by Storrie-Lombardi & Wolfe 
(2000) and Peroux et al. (2003), which have more statis- 
tics than the zinc sub-sample. Selecting the DLAs in the 
range 1.8 < z < 3 we obtained a sample of 59 systems. 
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Then logiV(Hl) and [Zn/H]=log(Z/Z Q ), we trans- 
formed the derived distributions in linear space, A(Hi) 
and Z, for later comparison with the distributions /h and 
fz- In computing the biased distributions with our proce- 
dure, we took into account the fact that the typical limit- 
ing magnitude of the Nr surveys (me ~ 19.5) is somewhat 
larger than that of the metallicity surveys (me ~ 19), ow- 
ing to the different requirements in spectral resolution. In 
fact, Eqs. (|A.12I) and (|A. 13|) can be solved independently 
of each other, inserting in each case the appropriate value 
of me- 

5.1.1. The frequency distribution of quasar magnitudes 

To derive the empirical distribution of quasar magnitudes 
we used the data of the Sloan Digital Sky Survey (SDSS) 
(Schneider et al. 2003). The distribution was determined 
for two photometric bands well representative of the visual 
part of the spectrum, namely the g' and r 1 bands, with ef- 
fective wavelengths A = 0.48/im and 0.62/im, respectively 
(Fukugita et al. 1996). In each band we binned the data at 
steps of 0.5 mag, counting all the quasars located beyond 
the typical absorption redshift of our sample (in practice, 
we adopted 2 < z e < 5). An example of empirical distribu- 
tion computed for the g' band is shown in Fig. [S] (diamonds 
with error bars). After binning the data, we performed a 
polynomial fit to the observed log n - log m distribution, 
up to the magnitude 21.5, for which the statistics and the 
completeness of the SDSS sample are still good. This fit 
was adopted as a smooth model of the observed distribu- 
tion in the magnitude range 17 < m < 21.5. An example 
of fit is shown in Fig. [3] (solid line). The limit m ~ 21.5 
is sufficient for studying the effects of quasar obscuration 
on the statistics on DLAs since the typical limiting mag- 
nitude of DLA surveys is currently me < 21. 



5.2. The shape of the true distribution 

One way to approximate the true shape of /jv h is to 
assume a geometrical distribution of the gas in DLA 
galaxies. For planar disks with radial exponential pro- 
files N±(r) — N±oexp(—ar), the expected frequency 
distribution is Jn h oc N^ 1 lnA^ for Afa <C N± and 
f Nil oc A H 3 for A H > N_i_ (FP93). For central column 
densities ATj_o <; 3 x 10 21 cm -2 or much larger, as inferred 
by FP93, this gives /n h oc A^ 1 In Ah for most of the 
column-density range typical of DLAs. 

A realistic approximation of /a? h should also take into 
account the fact that, in a given galaxy, the interstel- 
lar clouds show a spectrum of masses, m c , and sizes, r c , 
which is not accounted for by a smooth radial profile. 
A large number of studies indicate that the mass spec- 
trum of Milky- Way interstellar clouds follows a power law 
f(m c ) ~ mj, with 7 = —1.5 ± 0.2 over a 6 orders of 
magnitude range of masses (Scalo & Lazarian 1996 and 
refs. therein). From this mass spectrum, together with a 




18 2 
m (mag) 



Fig. 5. Diamonds: distribution of the SDSS g' apparent 
magnitudes for quasars in the interval 2 < z e < 5. Smooth 
lines: example of a pair of model distributions in which 
the "true" model (dashed line) is determined from the 
biased model that fits the observations (solid line), once 
the distribution of DLA extinctions and T\ are specified 
(see Section 4.1.1). 



relation between the internal density of the cloud, n, and 
the cloud size we can infer a column-density distribution. 
Adopting n ~ r p c with — 1.2 < p < (Scalo & Lazarian 
1996 and refs. therein) we find 5 that the column-density 
distribution should follow a power law fjy H ~ -^h^ w hh 
P > 2.5. If this column-density spectrum applies to DLAs, 
we would expect a decline of column densities faster than 
that predicted by the exponential profile model. 

Both the exponential profile model and the cloud mass- 
spectrum hypothesis suggest that the true distribution 
may be approximated with a power law. The hydrody- 
namic computations of DLAs by Cen et al. (2003) yield 
a simulated true distribution of Nn consistent with this 
conclusion. 

In light of these considerations, we adopt the simple 
law /jv h ~ for the true distribution, with the param- 
eter (3 to be determined by the method itself. If this is a 
good approximation, the resulting biased distribution 
must reproduce the observed fast decline of the number of 
DLAs with column density, which is commonly fitted by 
a Schechter-type function 

)-%-<*), m 

characterized by a fast decline at A% > A* (Pei & Fall 
1995; Storrie-Lombardi & Wolfe 2000; Peroux et al. 2003). 

P+3 

5 With the adopted relation we find m c ~ Ajf^ 1 and then 
derive the lower limit of /3 assuming p — 0. 
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Table 1. Results. 



Ext. curve 


Band 





a 




$22 (m^)" 
19.0/19.5 


$23(mf) b 
19.0/19.5 




f^22 

(io- 3 ) 


^23 

(IO" 3 ) 


MW 


r' 


1.56 


— 0.44 


-0.13 


0.41/0.34 


0.44/0.38 


-0.37 


2.3 


7.1 


MW 


9' 


1.51 


— 0.40 


-0.10 


0.48/0.42 


0.51/0.46 


-0.31 


2.8 


9.5 


SMC 


r' 


1.61 


-0.46 


-0.19 


0.33/0.27 


0.37/0.30 


-0.44 


1.9 


5.5 


SMC 

CL T T _ 1- ■ 


g' 


1.48 


-0.38 


-0.07 


0.52/0.46 


0.56/0.51 


-0.27 


3.1 


11.3 



5 Upper limit of integration in Eg. lUVl or 1I2II truncated at logTVuT = 22. 



b Upper limit of integration in Eq. I|17|l or 12 U truncated at logA^Hi = 23. 



5.3. The shape of the true metallicity distribution 

The arguments leading to the choice of fz can be sum- 
marized as follows. The observed distribution of [Zn/H] 
measurements peaks at [Zn/H] ~ — 1.1 dex and does not 
show systems below w — 2 dex or above solar metallicity 
(Pettini et al. 1997). Although this distribution is likely 
biased at both ends, we do expect a genuine decline in 
the number of DLAs below and above some critical val- 
ues of metallicity. At [Zn/H] < — 1.5 dex the zinc sample 
shows a decrease of the number of DLAs in a region of 
the (Nzn, Ah) plane not affected by any bias (the trian- 
gle DEF in Fig. I3J). At high metallicity it is reasonable to 
predict a decrease since we expect a natural decline of the 
number of systems with higher and higher star formation 
rates. These arguments indicate that the true distribution 
of [Zn/H] starts from a negligible value at low metallic- 
ity, shows a rise around w —1.5 dex and declines after 
reaching a maximum. We have been particularly careful 
in modelling this trend with an analytical function be- 
cause the results, and in particular the mean metallicity, 
depend on the adopted functional form. To choose the 
function we started by making an educated guess of the 
shape of the metallicity distribution based on our knowl- 
edge of the statistics of galaxies and DLAs. We then made 
sure that the adopted function satisfies two requirements: 
(1) that it is a "conservative" one, i.e. a function that does 
not overpredict the effects of the obscuration; (2) that the 
predicted extension of the function at high metallicities, 
where the obscuration is important, is not affected by the 
poor knowledge of the low end of the distribution. After 
considering several possibilities, we adopted the function 

fz cx (Z/Z*) a e- z / z ' , (20) 

which satisfies very well the above requirements, as we 
explain below. This function is similar to the one proposed 
by Schechter (1976) for describing the luminosity function 
of galaxies. 

To make an educated guess for the shape of the distri- 
bution we started from the metallicity-luminosity relation 
in galaxies. Evidence is building up that this relation is 
valid not only in the local Universe (e.g. Lamareille et al. 
2004), but also in DLA systems (Ledoux et al. 2005). The 
empirical relation between the logarithmic metallicity and 
the absolute magnitude implies that the linear metallicity 
Z increases linearly with the luminosity. The luminosity 



in turn follows a Schechter distribution both in the lo- 
cal Universe (e.g. Cuesta-Bolao & Serna 2003), at redshift 
z ~ 0.1 (e.g. Blanton et al. 2003) and, apparently, up to 
z ~ 3.5 (Poli et al. 2003). Combining these different pieces 
of evidence, it is reasonable to assume that Z may follow 
a Schechter distribution in DLA systems. 

The requirement (1) of a "conservative" function is 
equivalent to make sure that decline at high metallicity 
is fast. In fact, the faster the decline of the "true" distri- 
bution, the lower the estimated number of missed DLAs. 
Luckily, the Schechter satisfies this requirement since it 
provides a very fast decline at Z>Z„. In fact, the decline 
is faster than in any polynomial in Z and in a lognormal 
distribution. 

The requirement (2) implies that we should avoid func- 
tions for which a single parameter specifies the behaviour 
at both ends of the distribution. The Schechter function 
satisfies this requirement since the parameter a, which in 
practice controls the extension of the distribution at low 
metallicities, has a negligible effect on the exponential de- 
crease at Z > Z*. For a lognormal distribution, instead, 
the width parameter is affected by the fit of the distri- 
bution at the low-metallicity end, which is poorly con- 
strained by the observations. The resulting error on the 
width parameter will affect the extension of the distribu- 
tion at high metallicities and therefore the estimate of the 
obscuration effect. The same considerations apply to any 
distributions that, like the lognormal one, are specified 
by a centroid and a width. For these types of functions, 
the uncertain behaviour of the low end of the metallicity 
distribution affects the determination of the obscuration 
effect. 

6. Results and discussion 

In Table 1 we summarize the results obtained from the 
application of our procedure. The quasar extinctions were 
estimated in the photometric bands r' and g' for an ab- 
sorption redshift z — 2.3. The differences between the re- 
sults obtained for the two bands give an estimate of the 
uncertainty due to the lack of a homogeneous set of visual 
magnitudes for all the quasars of the adopted surveys. The 
obscuration fraction in the r' band is more conservative 
than that in the g' band, given the increase of the extinc- 
tion with decreasing effective wavelength. The results for 
the g' band (A = 0.44^m) are more comparable to those 
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derived in previous studies of the obscuration bias, which 
considered the B band (A = 0.44/im; Ellison et al. 2001). 

Two types of dust extinction curves were considered, 
namely the MW curve by Cardelli et al. (1988) and the 
SMC curve by Gordon et al. (2003). The presence or ab- 
sence of the 2175A extinction bump in DLAs is irrelevant 
for most of our results, because at z = 2.3 the extinction 
curves in the r' and g' bands do not sample the bump 
(Fig. I2J). The only marginal exception is the combination 
of the r' band with the MW-type extinction. If the bump 
is completely absent in DLAs, the MW results become 
closer to the SMC results for the r' band. 

For each combination of extinction type and photo- 
metric band we derived the best-fit values of the param- 
eters via x 2 minimization in the intervals 1.0 < (3 < 2.0, 
-1.0 < a < 0.0, and -1.0 < log(Z*/Z Q ) < +0.5. The 
best-fit parameters are listed in Table 1. We made sure 
the minimum is unique by careful inspection of the vari- 
ation of x 2 hi the specified parameter space. The typical 
fit errors of /3, a and log(Z»/%) are ~ ±0.12, ~ ±0.25 
and ~ ±0.15 dex, respectively. 

In Fig. El we show the true and biased distributions 
(dashed and solid lines) of the best-fit solution for the 
case of MW-type extinction and g' band. The empirical 
distributions (circles with error bars) are well reproduced 
by the computed biased distributions. Similar results are 
found for all the cases considered in Table 1. 

The effect of DLA extinction on the shape of distri- 
bution function of quasar apparent magnitudes can be 
seen in Fig. |3J where we show the adopted distribution 
n m = n(m; z) (dashed line) and the predicted biased dis- 
tribution nj^ = n h (m;z) (solid line) for the g' band and 
MW-type extinction. The agreement between n^, and the 
empirical distribution (diamonds) demonstrates the capa- 
bility of the procedure to account for the bias in a self- 
consistent way. The results are very stable for variations 
of ±0.1 of the parameter T\ around the adopted value 
T\ = 0.5. Fig. shows that the obscuration effect does 
not change substantially the shape of the distribution of 
apparent magnitude of the quasars, at least within the cur- 
rent values of limiting magnitude of the surveys. This is 
due to the fact that a large fraction of DLAs has relatively 
low H 1 column density, as shown in Fig. and, as a con- 
sequence, low extinction. The modest bias of the quasar 
statistics does not imply that the bias is unimportant for 
the DLAs statistics, as we discuss below. 

Fig. HJi is reminiscent of Fig. 8 by FP93 where these 
authors compare the model and empirical distributions of 
Ah and k. In fact, the dust-to-gas ratios k is defined by 
FP93 as an extinction per unit H 1 column density and is 
therefore almost equivalent to a metallicity, as one can see 
in our Eq. (|llf> . The fact that the metallicity distribution 
is much better constrained by the observations than the k 
distribution is an advantage of our approach. In FP93 the 
model distribution of Ah is derived using an exponential 
H 1 radial profile; in our work the model is meant to be the 
simplest analytical approximation of the true distribution 
in the range probed by the observations. 



6.1. The Hi column density distribution 

From the analysis of the H 1 column density distribution 
we derive two main results. First, the typical value of (3 
that we find, (3 ~ 1.5, is remarkably similar to that mea- 
sured in quasar absorbers of lower column densities, not 
affected by obscuration bias (Tytler 1987; Petitjean et al. 
1993; Storrie-Lombardi & Wolfe 2000). Second, the biased 
distribution /jj- derived from a simple power law success- 
fully reproduces the shape of the empirical distribution. 
Both results indicate that the fast decline of the number 
of absorbers observed in the DLA regime is due to the 
obscuration effect, the true distribution of column density 
being instead consistent with a simple power law, as for 
the majority of quasar absorbers. Quite interestingly, the 
typical values of (3 that we derive are intermediate between 
the predictions of the exponential profile model of FP93, 
[3 ~ 1 , and the values expected from the mass-spectrum of 
interstellar clouds, (3 J; 2.5 (see Section 5.2.2). This seems 
to indicate that both effects should be taken into account 
in the full theoretical treatment of the true distribution 
/jv h - The hydrodynamic simulations of DLAs by Cen et 
al. (2003) yield a slope slightly below 2, broadly consistent 
with our results. 

The approximation of the true distribution with a sim- 
ple power law should not be extrapolated in the range 
of column densities not sampled by the observations 
(log Ah 1 > 22). An intrinsic faster decline in that range 
must occur in order to avoid an infinite value of J^dla 
(see Section 6.5). A faster decline is expected in the ex- 
ponential profile model of disk gas distribution when Ahi 
approaches the central value Aj_o (FP93). Also the onset 
of physical mechanisms specific of very high density en- 
vironments may induce a sharp drop of the distribution 
at very values of Ahi (Schaye 2001). The present results 
suggest that the genuine fast decline of the Ah i distribu- 
tion may lie beyond the range probed by the observations. 
In principle, one could model this intrinsic decline with a 
Schechter function, but the position of the "knee" would 
be unconstrained. In the following we introduce an upper 
cutoff in the column density distribution when we need to 
estimate quantities affected by the high end of the Ahi 
distribution. 

6.2. The metallicity distribution 

The results on the metallicity distribution depend on the 
adopted model for the shape of the distribution. As dis- 
cussed in Section 5.3, we believe that the results obtained 
from a Schechter function are more conservative and reli- 
able than those derived from other functions (polynomi- 
als, lognormal or other functions parametrized in terms 
of "centroid" and "width"). In addition to the considera- 
tions given in Section 5.3 we note here that the position 
of the "knee" is sufficiently well constrained by the 
observational data (Fig.^Ja). 

The main result of the analysis of the metallicity dis- 
tribution is that a non-negligible fraction of DLAs exist 
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Fig. 6. Frequency distributions of of Hi column densities (left panel) and metallicities (right panel) in DLAs. Dashed 
lines: true distributions. Solid lines: distributions biased by quasar obscuration. Circles: empirical distributions obtained 
from DLA surveys. By imposing that the biased distributions best fit the observed ones, we infer the true distributions 
(see Section 4.4). The solution shown in the figure was derived for DLAs with a MW-type extinction and quasars 
observed in the g' band. 



with near solar metallicity. In previous work the metal- 
licity distribution of DLAs has been compared with that 
of different stellar populations of the Milky Way to cast 
light on the nature of DLA galaxies. Pettini et al. (1997) 
found that the [Zn/H] distribution observed in DLAs does 
not resemble that of any Milky- Way population. The true 
metallicity distribution of DLAs inferred in the present 
work changes significantly these conclusions. In Fig. we 
compare the DLA distributions obtained for the best-fit 
solutions listed in Table 1 with the stellar distributions 
published by Wyse & Gilmore (1995) for the thick and 
thin disk of the Milky Way (the same template used by 
Pettini et al. 1997). One can see that the true DLA distri- 
bution envelops those of both disk populations. This result 
is consistent with the paradigm that DLA galaxies are pro- 
genitors of present-day disk galaxies, even though there is 
plenty of room in the distribution for contributions from 
metal-poor galaxies. We cannot derive more information 
from these results because the exact shape of the true dis- 
tribution is uncertain, particularly in the tails, owing to 
the poor statistics of the surveys. In any case, the present 
results bring fresh support to the most commonly accepted 
paradigm on the nature of DLA galaxies, suggesting that 
most disks may already be in place at z ~ 2.3, consistent 
with what happened in the Milky Way. 



6.3. The total obscuration fraction 



In Table 1 we list the total obscuration $22 computed 
adopting Ahi = 10 22 atoms cm -2 as upper limit of inte- 



gration 6 in Eq. \Yty . This is equivalent to truncate the H 1 
distribution in the range of column densities not probed 
by the observations. The true obscuration factor might be 
higher than $22 if DLA systems exist also beyond such 
limit. An estimate of this uncertainty can be appreciated 
in Table 1, where we also give $23, obtained integrating 
the distributions up to Am = 10 23 atoms cm -2 . 

The total obscuration fraction lies in the range 
$22 ~ 0.33/0.52 for the magnitude limit typical of high- 
resolution surveys (m^ ~ 19.0). For spectroscopic sur- 
veys of moderate resolution, with me. ~ 19.5, we obtain 
$22 ~ 0.27/0.46. These figures are underestimated by only 
~ 3/5% if DLAs exist with the same power law, up to 
Ah 1 = 10 23 atoms cm -2 . 

The fraction of missing systems that we find is con- 
sistent with the range 0.23 to 0.38 estimated by PF95 at 
z ~ 3. 

Our estimate of total obscuration fraction can be com- 
pared with that obtained by Ellison et al. (2001) from the 
analysis of an unbiased sample of radio-selected quasars. 
By comparing their number density of DLAs, n(z), with 
that obtained by Storrie-Lombardi & Wolfe (2000) from 
an optically-selected (biased) sample, Ellison et al. con- 
clude that the unbiased n(z) is about ~ 50% larger than 
the biased one. Since the Storrie-Lombardi & Wolfe survey 
has a magnitude limit typical of moderate resolution sur- 
veys, this result should be compared with our obscuration 
fractions derived for nig = 19.5, which indicate that the 



The highest measured column density in a DLA system is 
currently iV H i = 10 2185 atoms cm" 2 (Prochaska et al. 2003). 
Column densities up to 10 22 atoms cm" 2 are found in the Milky 
Way ISM (Dickey & Lockman 1990). 
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Fig. 7. Distribution functions of the logarithmic metallic- 
ity normalized to unity. Curves: true distribution of DLAs 
at z ~ 2.3 inferred with our method. Dashed (dotted) 
lines: MW-type (SMC-type) extinction; red (green): re- 
sults for the r' (</) band. Histograms: Milky- Way stars 
belonging to the thick disk (shaded area) and thin disk 
(thick line) populations. 

true number is between ~ 40% and ~ 90% larger than the 
biased one. The general agreement with the result found 
by Ellison et al. represents an important test of validity 
of our procedure. 

6.4. Number density versus limiting magnitude 

One prediction of the obscuration bias is that the fraction 
of missed DLAs must decrease with increasing limiting 
magnitude of the survey. The variation predicted by our 
computations is shown in Fig. [SJl for the different cases 
considered in Table 1. The obscured fraction decreases, 
but does not vanish at least up to me — 21.5, where 
$22 ~ 0.1/0.2. This variation of <!> versus me offers an 
important observational test based on the measurement 
of the number density of DLAs, n(z), at increasing val- 
ues of mi in an unbiased sample. Ellison et al. did find 
an increase of n{z) with mi which supports the existence 
of the bias. The number density increases in the range 
19 SS mt 20, but flattens at mi ^ 20, while we predict 
that it should keep increasing at least up to mi <j 21.5. 
However, taking into account the experimental error bars 
there is no real discrepancy. To demonstrate this, we plot 



than a plateau. More stringent measurements are crucial 
for clarifyng this issue. It is clear, in any case, that the 
combination of our treatment of the bias with studies of 
unbiased samples offers a powerful tool for a quantitative 
estimate of the obscuration effect. 

6.5. The gas content of DLAs 

By using our best-fit A (Hi) distribution functions we can 
compute the total contribution of the gas in DLAs to the 
critical density of the Universe, 



DLA 



HqI-1 m H 
C/Ocrit 



A H fN H dN H 



(21) 



If /jv h is a power law with (3 < 2, as in our case, the in- 
tegral diverges and we must introduce an upper cutoff. In 
Table 1 we give the values O22 and f223 obtained integrat- 
ing up to Ah = 10 22 and 10 23 atoms cm -2 , respectively. 
The value ^22 is conservative since the true distribution 
Jn-h may extend above Ah ~ 10 22 cm -2 . The values that 
we find for different input parameters lie in a relatively 
narrow range, namely 1.9 x 10~ 3 < ^22 < 3.1 x 10~ 3 . 

To estimate the missed fraction of H 1 mass, we com- 
pare these values of Sl 2 2 with — 0-9 x 10~ 3 , the 
biased value obtained integrating our H 1 sample with the 
commonly adopted Schechter function (flfJf) . Since ^22 is 
a conservative estimate, this comparison indicates that at 
z ~ 2.3 the true comoving mass of DLAs is at least a fac- 
tor of 2 higher than that obtained by magnitude-limited 
surveys. 

The values of ^22 that we derive agree well with the 
value f^Eoi — 2.6 x 10~ 3 measured by Ellison et al. (2001) 
from the analysis of their "dust-free" sample. In addition, 
they agree with the value ^dla — 3 x 10~ 3 derived at 
z ~ 2.3 by Cen et al. (2003) from their hydrodynamic 
simulations. 

6.6. The metal content of DLAs 

The column-density weighted metallicity of DLA systems 
is used to measure the degree of metal enrichment of 
the population of DLAs as a whole (Pettini et al. 1999) 
and to estimate the mean cosmic metallicity of the high- 
redshift Universe (Pei & Fall 1995; Cen et al. 2003). The 
expression commonly used in literature to measure the 
weighted metallicity from a finite set of column densities, 
< (Zn/H) DLA >= (E» ^Zn,i)/(Ei can be put in the 

form 



in Fig.lHb n(z) 



(z) (1 — $) for an assumed total density < (Zn/H)DLA > : 



n*(z) = 0.4. One can see that these predictions agree with 
the measurements of n(z) performed by Ellison et al. If 
n*(z) = 0.4, the true number of systems would be ~ 90% 
larger than the biased n{z) measured by Storrie-Lombardi 
& Wolfe. This figure of obscuration is still consistent with 
our estimate. The only "disagreement" with Ellison et al. 
would be on the interpretation of the n{z) versus mi plot, 
for which we claim that there is a steady increase rather 



J ^Zn /JV Z . dN ^ 

/ A H /jvh dN H 



(22) 



to measure the same quantity from the frequency distribu- 
tion functions /jv z „ an d /at h normalized to unit area. If the 
metallicity Z and the H 1 column density are independent 
variables, as we assume in our mathematical formulation, 
then it is easy to show that < (Zn/H)oLA > equals the 
expectation value < Z>= J Z fz d Z estimated from the 
distribution fz normalized to unit area. 
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Fig. 8. (a) Fraction of obscured DLAs and (b) number density of DLAs predicted as a function of the limiting mag- 
nitude for the best-fit solutions of Table 1. Dashed and dotted lines: MW-type and SMC-type extinction, respectively. 
Red and green: results for the r' and g' bands. Stars with error bars: CORALS results from Ellison et al. (2001). 



In Table 1 we list the mean metallicity, normalized in 
the usual form log(< Z > /Zq), obtained for a Schechter 
metallicity distribution. The mean metallicity lies between 
—0.44 and —0.27 dex below the solar level for the different 
dust models and photometric bands considered in Table 
1. Considering the total error of ~ —0.3 dex in our es- 
timate (fit errors plus statistical errors of the empirical 
distributions) we conclude that the mean metallicity is 
> — 0.7 dex at 1 a level. These values are higher than 
the direct measurements of weighted metallicity of DLAs, 
which give ~ — 1.1 dex at the typical redshift of our sam- 
ple (Pettini et al. 1999; Prochaska et al. 2003). This large 
difference is probably be due to the fact that the weighted 
metallicity is extremely sensitive to the number of high 
column-density DLAs included in the average, which are 
the DLAs more affected by obscuration. Our results are 
marginally consistent with the mean weighted metallicity 
[<(Zn/H) DLA >] = -0.88 ± 0.21 dex obtained from the 
CORALS metallicity survey (Akerman et al. 2005). 

Quite interestingly, the high mean metallicity that we 
derive may help to solve the "missing metal" problem 
pointed out by Wolfe et al. (2003) in their work on the 
Star Formation Rates (SFRs) in DLAs: the mass of met- 
als produced by z ~ 2.5, inferred from their SFRs, is 30 
times larger than detected in absorption in DLAs. 

6.7. Observational thresholds due to obscuration bias 

Our method allows us to estimate, for the first time, the 
fraction of obscured DLAs as a function of H I column den- 
sity and metallicity Z , and also as a function of the total 
zinc column density. These obscuration fractions (f>g(Nn), 
4>e(Z) and 4>e(N^ n ), shown in Fig. should only be in- 
terpreted in a statistical sense. However, they provides us 
a powerful indication of how the statistical distributions 
of DLAs are distorted by the obscuration bias. In partic- 



ular, they provide a means of quantifying the existence of 
observational cutoffs induced by the bias itself. 

The analysis of cf>g(N^ n ), shown in panel (c), gives a 
quantitative explanation for the lack of DLAs systems 
above the obscuration threshold in Znll column density 
originally proposed by Boisse et al. (1998). We estimate 
that over 60% of DLAs are missed at N% n = 10 13 2 atoms 
cm -2 , and this fraction rises rapidly to ~ 95% if the N% n 
increases by ~ 0.5 dex. The threshold is rather indepen- 
dent of the adopted input parameters. Only for peculiar 
extinction curves without UV rise (gray extinction) we 
may expect that the threshold can be crossed. 

We can use our results to estimate how many DLAs 
should be observed in an unbiased survey in order to de- 
tect cases with high metal column density. From the typ- 
ical distribution //v Zn that we infer, we estimate that the 
number of DLAs in the range between the detection limit 
and the Boisse's threshold (11.5 < logTV(Znii) < 13.1) is 
about 24 times larger than the number of systems in a 
range of similar extension above the threshold (13.1 < 
logiV(Znii) < 14.7). This means that one is not guaran- 
teed to detect one DLA above the threshold even with 
~ 30 unbiased spectroscopic observations. This is cur- 
rently an observational challenge because the high extinc- 
tion of the DLAs above the threshold makes very hard to 
perform high-resolution spectroscopy, even for the quasars 
which lie at the bright end of the true distribution of ap- 
parent magnitudes. 

A natural consequence of the cutoff at high metal 
column densities, in conjunction with the artificial DLA 
threshold iVm > 10 20 3 atoms cm -2 , is the existence of an 
observational cutoff at high metallicities. From panel (b) 
one can see that about 90% of systems are missed at solar 
metallicity and it would be practically impossible to de- 
tect systems with oversolar metallicity, if they exist. Also 
worth of mention is the fast rise of the obscuration with 
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Fig. 9. Fraction of obscured DLAs as a function of (a) Hi column density, (b) metallicity Z, and (c) Znii column 
density for the best-fit solutions of Table 1. Dashed and dotted lines: caption as in Fig.|Hl 



increasing metallicity, which distorts dramatically our per- 
ception of the metallicity distribution of DLA systems. 
The fact the different curves plotted in panel (b) lie close 
to each other indicates that these results are rather inde- 
pendent of the parameters adopted. 

Also the distribution of H I column densities is severely 
distorted by the extinction bias, as shown in panel (a). 
The curves of <fi(Nn) obtained for different parameters are 
not tightly close to each other, but the general behaviour 
is quite similar in all cases. The obscuration starts from a 
low value in the sub-DLA regime and rises steadily over 
the DLA regime, approaching ~ 80% at TVhi ~ 10 22 atoms 
cm~ 2 . This is the approximate value of the observational 
threshold for detection of high column density DLAs. The 
rise of 4>(Nn), in conjunction with the genuine decrease of 
the true distribution , provides a natural explanation 
for the non-detection of DLAs at AThi 10 22 atoms cm~ 2 . 
This makes it unnecessary to invoke the sudden transition 
from atomic to molecular hydrogen proposed by Schaye 
(2001) in order to explain this empirical cutoff. In any 
case, if fully molecular clouds exist as predicted by Schaye, 
they would be characterized by a high column density of 
dust grains. The fact that quasar absorbers with N(H.2) > 
AT (Hi) have not yet been detected may indicate that also 
this type of absorber is missed due to obscuration. 

7. Summary and conclusions 

Starting from the theoretical relation between the inter- 
stellar extinction, A\ , and the column density of a volatile 
metal, Nx v , we investigated the detailed relation between 
A\ and Az n in DLAs. We derived in this way the Eq. 
1)11(1 . which gives the quasar extinction in the observer's 
frame as a function of the DLA hydrogen column density, 
Ah, metallicity, Z = Nz n /Nn, fraction in dust of iron, 
/fc and a factor G, which describes the dust grain prop- 
erties. For the first time, the metallicity evolution of the 
dust-to-metal ratio in DLAs is explicitly accounted for in 
this type of relation, using an expression /p c = fp e (Z) ob- 
tained from a previous study of depletions (Vladilo 2004) . 



We argue that the factor G may not vary substantially in 
interstellar environments with moderately low metallicity 
(Z £ 0.1 Zq) and derive a value of 60% the Milky- Way 
value for SMC- type dust. Possible variations of G at the 
early stages of chemical evolution do not affect the extinc- 
tion because fp c (Z) vanishes when Z <c Zq. 

We searched for empirical evidence of the rise of the ex- 
tinction with metal column density in DLAs. We did find 
an increase of the quasar magnitude with A^zn, consistent 
with this expectation (Fig.0J). 

Starting from Eq. (|ll|l , we derived a mathematical for- 
mulation aimed at estimating how the frequency distribu- 
tions of H I column densities and metallicities of a sample 
of DLAs are biased as a result of the obscuration gener- 
ated by the DLAs of the same sample. We ignore the ob- 
scuration due to low-redshift DLAs along the same line of 
sight, showing that this contribution is relatively low. At 
variance with the formulation of PF95, where a constant 
dust-to-metal ratio was adopted for all DLAs, we adopted 
a metal-dependent dust-to-metal ratio fF e (Z). We cross- 
checked the validity of our formulation making use of the 
equations derived by FP93 for the case of a power law 
quasar luminosity function and constant dust-to-gas ratio 
(Section A. 1.2). 

We presented a practical procedure for recovering the 
unbiased distributions of column densities and metallic- 
ities, Jn h and fz, using the empirical distributions ob- 
tained from magnitude-limited surveys. The unbiased dis- 
tributions are modelled with simple analytical expressions 
in parametric form. The bias induced by the extinction on 
the quasar magnitude distribution is accounted for self- 
consistently by the procedure. A unique characteristic of 
the method is the possibility of recovering the true metal- 
licity distribution starting from an educated guess of the 
functional form of the distribution. We have shown that 
using a Schechter function for the metallicity distribution 
appears to provide more conservative and reliable results 
than other functions (e.g. lognormal or polynomials) for 
the estimate of the bias (Section 5.3). 
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We applied our method to the sample of DLAs in 
the redshift interval 1.8 < z < 3, with mean redshift 
<z>~ 2.3, where the bulk of spectra are available. The 
extinctions were computed in the g' and r' photometric 
bands of the SDSS, considering both a MW-type and an 
SMC-type average extinction curve. We found that the 
effect of the obscuration on the quasar magnitude distri- 
bution is modest (e.g. Fig. 5). On the other hand, the bias 
plays an important role in shaping the statistical distribu- 
tions of DLAs. 

The unbiased distribution function of H I column den- 
sities is successfully approximated by a power law /jv h 
N-^P , with (3 ~ 1.5. This simple law, in conjunction with 
the bias effect, is able to fit the observed decline of the 
number DLAs with Ah up to N(H i) ~ 10 218 atoms cm -2 . 
The faster drop of the distribution expected at very high 
column densities (FP93; Schaye 2001) probably lies out- 
side the range currently probed by the observations. The 
slope (3 ~ 1.5 is remarkably similar to the value which 
characterizes most of quasars absorbers with lower col- 
umn densities, for which the extinction bias is negligible. 
The value of the slope suggests that the mass spectrum of 
interstellar clouds may play an important role in shaping 
the distribution. 

The observed distribution of metallicities is reproduced 
modelling the unbiased distribution with a Schechter func- 
tion f z oc {Z/Z*) a e - z / z % with a ~ -0.4 and 
log(Z*/ZQ) ~ —0.15 dex. Once converted into logarithmic 
scale, this function peaks at [Zn/H] ~ —0.3/ — 0.4 dex, in- 
dicating that a non- negligible fraction of DLAs exist with 
near solar metallicity. The true metallicity distribution of 
DLAs envelops the distributions of the thick-disk and thin- 
disk Milky- Way stellar populations. This result is in line 
with the paradigm that DLA galaxies are progenitors of 
present-day disk galaxies. However, in the inferred distri- 
bution there is also room for a relevant contribution from 
metal-poor galaxies. 

The mean weighted metallicity that we derive , ~ 
—0.4/— 0.3 dex, is significantly higher than the column- 
density weighted metallicity <[Zn/H]>= —1.1 dex mea- 
sured at z ~ 2.3. This high value of mean metallicity may 
help to solve the discrepancy between the metallicity ob- 
served in DLAs and that predicted on the basis of their 
SFRs (Wolfe et al. 2003). 

The fraction by number of obscured DLAs, <£>, de- 
creases with increasing limiting magnitude of the survey, 
mi. We obtain $ 0.4/0.6 at mg ^ 18.5, a limit repre- 
sentative of high resolution spectroscopic surveys, carried 
out with 4-m class telescopes. The corresponding figures 
for high-resolution surveys in 10-m class telescopes are 
$ ~ 0.35/0.55 {m t ~ 19.0). 

An important result of our work is the explanation 
(Fig. Et) of the observational limit Nz n ~ 10 132 atoms 
cm~ 2 (Boisse et al. 1998) in terms of very simple physics 
(Section 2), with no tuning of the local dust parame- 
ters. Alternative explanations would require an additional 
mechanism able to produce the exact same limit predicted 



by the extinction. The simplicity of the extinction model 
favours the existence of the obscuration. The existence of 
the threshold Nz n ~ 10 132 atoms cm~ 2 is fundamental for 
reconciling the predictions of galactic models (Prantzos & 
Boissier 2000, Hou et al. 2001; Churches et al. 2004) and 
cosmological simulations (Cen et al. 2003, Nagamine et 
al. 2004) with the observations of DLAs. Only for gray 
extinction curves, without UV rise, we may expect that 
the threshold can be crossed. 

Our results on the magnitude of the obscuration effect 
are broadly consistent with those presented by FP93 and 
PF95, with some important differences. The modest bias 
of quasar statistiscs that we find is at variance with the 
predictions of FP93, which allowed up to 70% of quasars 
to be obscured. Contrary to FP93, we do not expect a sig- 
nificant contribution of DLA extinction at redshift z > 3 
in spite of the rise of A \ with z. In fact, our model of dust 
fraction fp c {Z) vanishes at very low metallicity so that 
the extinction (jl 1|> vanishes at z > 3, when the metallic- 
ity Z < O.OIZq. 

A crucial test of our results is the comparison with 
studies of radio-selected samples of quasars/DLAs. Our 
determinations are consistent with the estimates of ob- 
scuration fraction, number density and ^dla obtained 
from the CORALS survey (Ellison et al. 2001). Our pre- 
diction of mean metallicity is marginally consistent with 
the recent estimate of mean weighted metallicity of the 
CORALS sample (Akerman et al. 2005). 

Accurate estimates of the obscuration effect will be 
possible as soon as the samples with metallicity measure- 
ments become sufficiently large for precise statistical anal- 
ysis. 
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Appendix A: Mathematical formulation 

A.l. True and biased distributions 

To derive our mathematical formulation we define the fre- 
quency distributions of DLAs and quasars most appropri- 
ate for taking into account the causes and effects of the 
extinction bias. To account for the DLA extinction via Eq. 
(|ll|l we are interested in the number of DLAs per unit hy- 
drogen column density and metallicity. To account for the 
dimming of the quasar we are interested in the number 
of quasars per unit of apparent magnitude. The apparent 
magnitude, m, can be directly compared with the limiting 
magnitude of the observational survey, m^. 

We call /i(Ah, Z, z) g?Ah dZ dz du> the number of neu- 
tral hydrogen layers with column density Ah € (ATjj, Ah + 
cOVh) and metallicity Z 6 (Z, Z+d Z) located in the spher- 
ical shell of redshift z 6 (z, z + dz) in the infinitesimal solid 
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angle du> in a random direction (we assume that the gas 
layers are distributed isotropically). 

We call q{m,z e )dmdz e the true number of quasars 
with apparent magnitude to G (to, to + dm) in the shell 
of emission redshift z e G (z e , z e + dz e ) all over the sky 
(we assume an isotropic distribution of quasars). We de- 
fine n(m; z, z™ ax ) = J e q{m, z e ) dz e , where z is the ab- 
sorption redshift of a foreground Hi layer and z™ ax the 
maximum redshift of the quasars in the survey. For sim- 
plicity of notation we omit hereafter the dependence on 
z°™. Therefore n(m; z) dm is the total number of quasars 
with magnitude m G (to, to + dm) observable beyond red- 
shift z all over the sky. This is the true number in absence 
of quasar obscuration. 

We call 5oj q (z) the average solid angle subtended in the 
observer's optical band by quasars in the redshift interval 
z < z e < z™ ax . 

With the above definitions, the number of DLAs with 
Nh S (Ah,Ah + dN H ), Z G (Z,Z + dZ) and redshift 
z G (z, z + dz) in front of one single background quasar 
with to < mg, is 

/i(7Vh, Z, z) dNn dZ dz 5(v q (z) 

provided 7V H = iV(Hi) > 10 20 3 atoms cm -2 . The num- 
ber of DLAs in the same differential element d.Nn dZ dz 
that are detectable in front of the n(m; z) dm background 
quasars with to G (to, to + dm) is 

h(Nn, Z, z) dNndZ dz Suj q (z) x n(m;z)dm, 

provided to < me. By integrating in dm up to the limiting 
magnitude me we calculate the total number of DLAs that 
are detectable in the survey. We first consider the ideal 
case in which no quasar extinction is present. In this case 
the unbiased number of detectable DLAs in c?7Vh dZ dz is 



g mt (Ah, Z, z) dNn dZ dz 



h{N- a ,Z,z)5uj q (z) 



n(m; z) dm 



dN H dZdz. (A.i; 



We call g mi (Nn, Z, z) the true distribution of detectable 
DLAs in dNndZdz. 

We then consider the quasar extinction A\(Nb_, Z; z) 
due to the DLAs in cWh dZ dz (Eq. Illfl ignoring other 
sources of extinction. From this we define the bias function 



bra^Nu.Z^m, z) 



1 if to < me — A\(Nn 1 Z; z) 
if to > m,e - Ax(Nn,Z;z) 



The number of DLAs in the survey that are 
missed as a consequence of their own extinction is 
(g me — 9m e ) dNn dZ dz. The fraction of DLAs missed as 
a consequence of their own extinction is 

Vme (N H ,Z, z) = 9me - ^ = 1 - B mt (N H , Z, z) , (A A) 



where 

r 

B mi {N m Z,z) = ^ 



9 m £ 



7n i -A x (N n ,Z,z) 



n(m\ z) dm 



J o ' n(m; z) dm 



(A.5) 



The ratio of the integrals of the magnitude distribution 
in Eq. (|A.5(1 is reminiscent of the ratio of the integrals of 
the quasar luminosity function in Eq. (3) of FP93. This 
ratio represents, in a sense, the core relation from which 
the bias effect is estimated. 

The relation between the true distribution and the dis- 
tribution biased by its own DLAs is 



9 m , ( n h, Z, z) = B mt (A H , Z, z) g mi (A H , Z, z) 



(A.6) 



This relation is independent of the average solid angle 
Sujq(z). The normalization factor of n(m, z) is also irrele- 
vant because it is eliminated in Eq. (|A.5J| . 

To make practical use of Eq. I|A.6(I we need to un- 
derstand the relation between the biased distribution 
g me (N-R, Z, z) and the observed distribution of DLAs in 
dN^dZdz, that we call g^(Nn, Z, z). To derive an ex- 
pression for the observed distribution we must consider all 
the sources of obscuration, including the DLAs at z' < z 
that lie in the same lines of sight of the DLAs at redshift 
z. If these additional sources of obscuration are negligible, 
then the expression ljA.61) is a good approximation of the 
observed distribution (see A. 1.1), i.e. 



g°^(N H ,Z, z)~g° mi {N u ,Z,z). 



(A.7) 



(A 



The number of DLAs in dNn dZ dz that, in spite of their 
own extinction, are detectable in front of the n(m; z) dm 
background quasars is 

b me (Nti, Z, to, z) h(Nn, Z, z) dNn dZ dz Sujq(z) n(m; z) dm. 

By integrating in dm we obtain the total number of DLAs 
in dNftdZdz that, in spite of their own extinction, are 
detectable in the survey 



Using the above approximation, the treatment of the ob- 
scuration bias is self-contained, in the sense that given the 
true distributions of column densities and metallicities at 
a redshift z, the observed distributions of the same quan- 
tities are uniquely determined. In this way we can study 
the obscuration effect in a redshift interval sampled by the 
observations, even if we do not have good statistics of the 
DLAs at lower redshifts. 
„n To compare the distributions with the observations we 
integrate Eq. ljA.61) in dz over the interval (z m j n , z max ) 
with ~z = (z max — Zmin) /2 close to the peak of the observed 
distribution. If the statistical functions of interest show a 
smooth variation inside the interval we can approximate 
them with their value at z = z. We obtain 



g^ ne (A H , Z, z) dN H dZ dz = h(N n , Z, z) Suj q (z) x 



f h (N H ,Z) ~B mt (N H ,Z) f(N H ,Z) , 
where 

f h (N H ,Z) = f maX 5 b (A H ,^,z) dz 



(A. 



m e -A x (Nn,Z,z) 



n(m; z) dm 



dN K dZ dz . 



(A.3) 



.g(A^H, Z, z) dz 



(A.9) 
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and 



B me (N H ,Z) = B me (N H ,Z,z) 



(A.10) 



For simplicity we omit the dependence of these functions 
on z, z m ; n and z max - 

We now transform the distribution f(Nn,Z) into a 
distribution of column densities, /at H ; and a distribution 
of metallicities, fz, defined in such a way that fN H dNn is 
the true number of detectable DLAs with column densities 
between Ah and N-n + dNn in the interval (z m i n , z max ) and 
fz dZ the true number of detectable DLAs with metallic- 
ity between Z and Z + dZ in the same redshift interval. 
We call /]y H and f\ the corresponding biased distribution 
functions. 

To proceed further we assume that Ah and Z are 
independent variables 7 (this assumption is discussed in 
Section 4.4). This implies that )(N R ,Z)dN H dZ = 
/jv h dNn fz d Z. We substitute this expression in the right 
hand of Eq. (|A.8|I . We then integrate that equation in 
mctallicity and obtain 



In the term ip' we should not count the DLAs at redshift z 
that are missed due to their own extinction, since they are 
already counted in the term ip. We should instead count 
only the fraction (1 — ip) of DLAs at redshift z that are 
not missed due to their own extinction. Of this residual 
fraction, we must in turn take the fraction of DLAs at 
redshift z that have an additional DLA at z' < z in the 
same line of sight, F(z', z). Of the resulting fraction, we 
must finally consider the mean fraction of DLAs at redshift 
z' < z that are missed due to their own extinction, 7p(z'< 
z). From these considerations we obtain the relation 



ip' = (1 - ip) x F(z', z) x Tp(z'< z). 



(A.15) 



From Eqs. HA.4|) and IjAJjl we have g^ (A H , Z, z) 



gme(N}i,Z,z) x (1 — ip). Comparing with Eq. (|A.14(I 
one can see that the approximation of Eq. (|A.7(I . 



O obs 



(A H , Z, z) ~ (A H , Z, z), is valid when 



<p'<l-<p (A.16) 
By combining Eq. i|A.15|) with (|A. 16|> we obtain that 



f h {N H ,Z)dZ 



B me {N H ,Z)f z 



dZ 



Apart from a constant factor, the left term of this equation 
represents the biased distribution /jj- . The constant can 
be determined by imposing that in absence of bias the 
true and biased distributions must be equal. From this we 
derive the relation 



/jV H 



J Q 00 B me (N li ,Z)f z dZ 
Jo°°fzdZ 



fN H 



In a similar way, by integrating Eq. 
obtain the relation 



f™B me (N Il ,Z)f NH dNH 



where Adla = 10 20 3 atoms cm -2 



fz 



A. 1.1. Condition of validity of Eq. (|A.7|) 



f Na .{A.11) F ( Z '> Z ) x^ me (z'<z) « 1 (A.17) 

describes the limits of validity of the approximation (|A.7|) . 
We estimate the validity of this condition at z ~ 2.3, the 
mean absorption redshift of the surveys. 

From a recent summary (Ellison et al. 2004) of the 
unbiased number density of DLAs, tt-dla^), we calculate 
that the probability of intersecting 2 DLAs in a random 
line of sight up to z ~ 2.3 is 0.15. We take this value as 
an estimate of F(z', z). 

By comparing the number densities of biased and un- 
biased surveys there is room for no more than ss 50% 
of missed DLAs at z ~ 2.3 (Ellison et al. 2001, 2004). 
This represents an upper limit to Jp m (z' < 2.3) because 
the extinction in the observer's frame tends to decrease 
with decreasing redshift and therefore the missed fraction 
tpm e (z' < z) tends to diminish at lower redshifts. 

All together, we expect therefore F(z',z) x p me (z' < 
z) & 0.08 at z ~ 2.3 and lower values at lower redshifts. 
We conclude that the approximation (|A.7|) is good, even 
if it may slightly underestimate the obscuration fraction. 



(A.12) 



81 in dA H , we 



(A.13) 



We start by deriving the relation between the true 
number of detectable DLAs in absence of obscu- 
ration, g mf (A?H, Z, z) dNftdZ dz, and the number ob- 
served when all sources of obscuration are considered, 
g^(N H ,Z,z)dN }i dZdz. 

At redshift z the observed number equals the true num- 
ber minus the fraction of DLAs missed as a consequence of 
their own extinction, tp = p me (Nn, Z, z), minus the frac- 
tion of DLAs at redshift z missed due to the extinction of 
other DLAs at z' < z that lie in the same line of sight, 
V>' = V'm t (^h, Z, z). This implies 



<C(A H , Z, z) =g me (NH,Z,z)x(l-<p- ip') 



(A.14) 



7 A similar assumption was done by Fall & Pei (1993; 
Appendix) to separate the distribution of column densities 
from that of dust-to-gas ratios. 



A. 1.2. Test of Eqs. (|A~5|) . (|A~6jl and (|A~T2jl 

To test our formulation we cross-checked its predictions in 
a particular case considered by Fall & Pei (1993). Namely, 
we compared the predictions of our Eq. (|A. 6|) , calculated 
at constant metallicity Z, with those of Eq. (21) by FP93, 
which is calculated at a constant dust-to-gas ratio k. This 



test is equivalent to insert in Eq. I)A.12|) a Dirac delta 
distribution of metallicities. To make this comparison we 
transformed the power-law distribution of quasar lumi- 
nosities used by FP93 [their Eq. (8)] into a magnitude dis- 
tribution n(m; z) for our Eq. l|A~5)l . We adopted f3 = 2 and 
£( jX^) = as common values in both equations. In Eq. 
(2l; FP93) we adopted a value of k = 0.04 consistent with 
our choice of G = 1, Z/Zq = 0.1 and f Fc (Z) = 0.52 for 
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Fig. A.l. A test of validity of the equations presented 
in Section 4.1. Circles: predictions of our Eq. 1A.5I) . 
Diamonds: f (N, z)/ f t (N, z) predicted from Eq. (21) by 
Fall & Pei (1993). The adopted parameters are given in 
Appendix A. 1.2. 

this particular test. In Fig. IA.l1 we compare the resulting 
ratio f (N,z)/f t (N,z) predicted from Eq. (21) by FP93 
with the corresponding ratio (-/Vh, Z, z) /g me (-Wh, Z, z) 
predicted from our Eq. l|A.6Jl . which equals the term 
B mi {Nii, Z, z) of Eq. (|A.5(I . Our predictions (empty cir- 
cles) perfectly match those obtained with the equations 
derived by FP93 (filled diamonds). Our results are in- 
dependent of mi provided mi 3> 1 and mi 3> A\. For 
mi k 19 - 20 these conditions are well satisfied. 

A. 2. The quasar magnitude distribution 

We derive the relation between the "true" magnitude 
distribution of quasars, n(m;z), and the observed one, 
which is biased by the obscuration effect. We consider 
only the obscuration due to DLA systems. We start by 
defining q h (m, z e ) dmdz e the observed (biased) number 
of quasars with magnitude m G (to, to + dm) at redshift 
z e G (z e ,z e + dz e ). We then divide the quasars accord- 
ing to the number i of DLAs that lie in their direction. 
We call !Fi(z e ) the fraction of quasars at redshift z e that 
have i intervening DLAs. Depending on the value of i, 
the distribution of apparent magnitudes of the quasars 
will be different. We call 5^ (to, z e ) dm dz e the biased num- 
ber of quasars with i foreground DLAs, magnitude m G 
(to, to + dm) and redshift z e G (z e , z e + dz e ). We now as- 
sume that the fraction of quasars with two or more DLAs 
is negligible (see A. 1.1), i.e. that Ti>2{z e ) <C To{z e ) and 
Fi>2{z e ) -C J-\{z e ). From the condition ^i( z e) = 1> 

we obtain J-o(^e) — 1 — J~i{z e ). This gives 

q h (m, z e ) ~ [1 - J"i(z e )] q(m, z e )+Fi{z e ) q\{m, z e ) (A.18) 

where q(m, z e ) = <?§(to, z e ) because the true distribution, 
q(m,z e ), must equal the distribution of quasars without 
foreground DLAs, q^(m, z e ). 



We now integrate Eq. I|A.18(I in dz e over the interval 
(z, z™ ax ), where z is the absorption redshift and z™ ax the 
maximum redshift of the quasars in the survey. We assume 
!Fi(z e ) to vary smoothly in the interval, so that it can be 
approximated with its mean value T\ . This gives 



n°(m\ z) ~ 1 1 — T\\ n(m; z) + T\ n\(m; z) 



(A.19) 



where n h (m;z) = j*" q h {m, z e ) dz e and n\(m;z) = 
J*" q\(m,z e )dz e . 

We now derive an independent relation for n\(m;z). 
The apparent magnitude of the quasars with one DLA, 
to, equals the sum of the "true" magnitude in absence 
of extinction, to', plus the DLA extinction in the ob- 
server's frame, A\. The frequency distribution of the sum 
to = to' + A\ is given by the convolution of the frequency 
distributions of to' and Ax, since these are independent 
variables. This implies 

/>oo 

?7.k(m; z) = K I n(m';z) /a x (to — to'; z e ) dm' , (A. 20) 



where if is a normalization factor, /A A (^A;^e) is the fre- 
quency distribution of the extinctions of all the DLAs in 
the redshift range z G (0, z e ), and z e is the mean redshift 
of the quasars in the survey, z e G (z, z™ ax ). 

The fraction T\ is estimated from statistical surveys. 
From the statistics of quasars with z e ~ 3.0 one finds, in 
the redshift path observable from ground (2 > 1.8, z ~ 
2.3), that w 40% of the quasars have one single DLA (see 
Peroux et al. 2003). The true fraction T\ will be slightly 
higher than 0.4 owing to the limitations of the wavelength 
coverage of the spectra. We adopt here T\ ~ 0.5 ± 0.1. 

We assume that /a a (A\;~z e ) can be approximated with 
the distribution of extinctions in the redshift range where 
most DLAs are observed, z G (1.8,3.0). In this way the 
problem of estimating fA x {A\;~z e ) is brought back to 
that of estimating the distributions of column densities 
and metallicities in the same redshift range, /at h and fz- 
Assuming that Nn and Z are statistically independent 
variables, on the basis of Eq. (|llfl we obtain the extinc- 
tion distribution by convolving /jv H and fz- 

The normalization factor K is determined from the 
condition J n\(m; z) dm — J n(m;z)dm, which, from 
the integration in dm of Eq. (|A.19p . is equivalent to 
J n h (m;z)dm — J n(m; z) dm. These conditions guaran- 
tee that the total number of quasars is independent of the 
bias, since the only effect of the extinction is a redistribu- 
tion of the apparent magnitudes. 
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